- 71 - 

SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANTS: Seed, Brian at al. 

(ii) TITLE OP INVENTION: Targeted Cytolysi. of HIV-infected Cell, by 

Chimeric CD4 Receptor-Bearing Celle 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) ME DIUM TYPE: 3\5" Diskette, 1.44 Mb 

^ (B) COMPUTER: IBM PS/2 Model 50Z or 55SX 

^ (C) OPERATING SYSTEM: IBM P.c. DOS (Version 3.30> 

(D) SOFTWARE: WordPerfect (Version 5.0) 

Cf 

(▼i) CURRENT APPLICATION DATA: 

~ . (A) APPLICATION NUMBER: 

^ (B) FILING DATE: 

Ul (C) CLASSIFICATION: 

ill 

Cl PRIOR APPLICATION DATA: 

^ (A) APPLICATION NUMBER: 07/847,566 

s (B) FILING DATE: March 6, 1992 

□ (C) CLASSIFICATION: 

to , 

ni < v "> PRIOR APPLICATION DATA: 

s ; (A) APPLICATION NUMBER: 07/665,961 

4= (B) FILING DATE: March 7, 1991 

p (C) CLASSIFICATION: 

1 . 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE / DOCKET NUMBER: 00786/212001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 542-5070 

(B) TELEFAX: (617) 542-8906 
(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 1728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEO, ID NO:l: 





ATOAACCGGGT 


GAGTCCCTTT 


TAGGCACTTO 


CTTCTGGTGC 


TGCAAfTIWlO 


50 




GCTCCTCCCA 


GCAGCCACTC 


AGGGAAACAA 


AGTGGTOCTO 




100 




GGGATACAGT 


GGAACTGACC 


TGTACAGCTT 


CCCAGAAGAA 


GAGCATACAA 


150 




TTCCACTGGA 


AAAACTCCAA 


CCAGATAAAO 


ATTCTGGGAA 


ATCAOOOTTr 


200 




CTTCTTAACT 


AAAGGTCCAT 


CCAAGCTGAA 


TGATCGCGCT 


GACTCAAGJLA 


250 




OAAGCCTTTO 


GGACCAAGGA 


AACTTCCCCC 


TGATCATCAA. 


GAATCTTAAG 


300 




ATAOAAOACT 


CAGATACTTA 


CATCTGTGAA 


GTGGAGGACC 


AGAAGOAOOA 


350 




GGTGCAATTO 


CTAGTGTTCG 


GATTGACTGC 


CAACTCTGAC 




400 




TTCAGGGGCA 


GAGCCTGACC 


CTGACCTTGG 


AGAGCCCCrr 




450 




CCCTCAOTGC 


AATGTAGGAG 


TCCAAGGGGT 


AAAAACATAr 




500 




GACCCTCTCC 


GTGTCTCAGC 


TGGAGCTCCA 




ACCTGGACAT 


550 


OCACTOTCTT 


GCAGAACCAG 


AAGAAGGTGG 


AGTTCAAJL&T 


AUACATCGTQ 


600 


W 


GTOCTAOCTT 


TCCAGAAGGC 


CTCCAGCATA 


GTCTATAAGA 




650 


in 


ACAGGTGGAG 


TTCTCCTTCC 


CACTCGCCTT 


TACAGTTGAA 




700 


lii 


GCAGTGGCGA 


GCTGTGGTGG 


CAGGCGGAGA 


GGGCCTCCTC 


CTCCAAOTCT 


750 


b 


TGGATCACCT 


TTGACCTGAA 


GAACAAGGAA 


GTGTCTGTAA 




800 


CO 


CCAGCACCCT 


AAGCTCCAGA 


TGGGCAAGAA 


GCTCCCGCTC 




850 




TOCCCCAGGC 


CTTGCCTCAG 


TATGCTGGCT 


CTGGAAACCT 




900 




CTTGAAGCGA 


AAA CA GO AAA 


GTTGCATCAG 


GAAGTGAACC 




950 




GAGAGCCACT 


CAGCTCCAGA 


AAAATTTGAC 


CTGTGAGGTG 




1000 




CCTCCCCTAA 


GCTGATGCTG 


AGCTTGAAAC 


TGGAGAACAA 


GGAO/TT'R & & ft 


1050 




GTCTCOAAGC 


GGGAGAAGCC 


GGTGTGGGTG 


CTGAACCCTO 




1100 




GTGGCAGTGT 


CTGCTGAGTG 


ACTCGGGACA 


GGTCCTGCTG 


U. LAAUl 


1150 




TCAAGGTTCT 


GCCCACATGG 


TCCACCCCGG 


TGCACGCGGA 


w LLAAACiC 


1200 




TGCTACTTGC 


TAGATGGAAT 


CCTCTTCATC 




TCATGACAGC 


1250 




CCTGTACCTG 


AGAGCAAAAT 


TCAGCAGGAG 


TGCAGAGACT 


GCTGCCAACC 


1300 




TGCAGGACCC 


CAACCAGCTC 


TACAATGAGC 


TCAATCTAGG 


GCGAAGAGAG 


1350 




GAATATGACC 


TCTTGGAGAA 


GAAGCGGGCT 


CGGGATCCAG 


AOATGOGAGG 


1400 




CAAACAGCAG 


AGGAGGAGGA 


ACCCCCAGGA 


AGGCGTATAC 


AATOCACTGC 


1450 




AGAAAGACAA 


GATGCCAGAA 


GCCTACAGTG 


AGATCGGCAC 


AAAAGGCGAG 


1500 
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AOOCGGAOAO OCAAOCOGCA CGATOCCCTT TACCAOGACA OCCACTTCCA 1550 
AGCAGTGCAG- TTCGGGAACA GAAGAGAGAO AGAAGOTTCA OAACTCACAA 1600 
GGACCCTTGG GTTAAGAGCC CGCCCCAAAO GTGAAAGCAC CCAGCAGAOT 1650 
AGCCAATCCT GTGCCAGCGT CTTCAGCATC CCCACTCTGT GGAGTCCATO 1700 
GCCACCCAGT AGCAGCTCCC AGCTCXAA 

1728 

(2) I NPORMATI ON POR SBQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSt double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genonic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 





ATGAACCGGG 


GAGTCCCTTT 


TAGGCACTTG 


CTTCTGGTGC TOCAACTGGC 




i z 1 


GCTCCTCCCA 


GCAGCCACTC 


AGGGAAACAA 


AGTGGTGCTG GGCAAAAAAG 


i no 




GGGATACAGT 


GGAACTGACC 


TGTACAGCTT 


CCCAGAAGAA GAGCATACAA 


XDU 


2.3 


TTCCACTGGA 


AAAACTCCAA 


CCAGAXAAAG 


ATTCTGGGAA ATCAGGGCTC 


200 


SI 
s 


CTTCTTAACT 


AAAGGTCCAT 


CCAAGCTGAA 


TGATCGCGCT GACTCAAOAA 


250 


ffs 
i y 


GAAGCCTTTG 


GGACCAAGGA 


AACTTCCCCC 


TGATCATCAA GAATCTTAAG 


300 


ATAGAAGACT 


CAGATACTTA 


CATCTGTGAA 


GTGGAGGACC AGAAOGAGGA 


350 




GGTGCAATTG 


CTAGTGTTCG 


GATTGACTGC 


CAACTCTGAC ACCCACCTGC 


400 


I 


TTCAGGGGCA 


GAGCCTGACC 


CTGACCTTGG 


AGAGCCCCCC TGGTAGTAGC 


450 




CCCTCAGTGC 


AATGTAGGAG 


TCCAAGGGGT 


AAAAACATAC AGGGGGGGAA 


500 




GACCCTCTCC 


GTGTCTCAGC 


TGGAGCTCCA 


GGATAGTGGC ACCTGGACAT 


550 




GCACTOTCTT 


GCAGAACCAG 


AAGAAGGTGG 


AGTTCAAAAT AGACATCGTG 


600 




OTOCTAGCTT 


TCCAGAAGGC 


CTCCAGCATA 


GTCTATAAGA AAGAGGGGGA 


650 




ACAGGTGGAG 


TTCTCCTTCC 


CACTCGCCTT 


TACAOTTGAA AAOCTOACOG 


700 




GCAGTOGCGA 


GCTGTGGTGO 


CAGGCGGAOA 


GGGCTTCCTC CTCCAAOTCT 


750 




TGGATCACCT 


TTGACCTGAA 


GAACAAGGAA 


GTOTCTOTAA AACGGGTTAC 


800 




fccAGGACCCT 


AAGCTCCAGA 


TGGGCAAOAA 


OCTCCCOCTC CACCTCACCC 


850 




TGCCCCAGGC 


CTTGCCTCAG 


TATGCTGGCT 


CTGGAAACCT CACCCTOGCC 


900 




CTTGAAGCGA 


AAACAGGAAA 


GTTGCATCAO 


GAAGTGAACC TGGTGGTGAT 


950 
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OAOAGCCACT CAGCTCCAGA AAAATTTOAC CTOTOAOOTO TOOOOACCCA 
CCTCCCCTAA GCTOATOCTO AGCTTCAAAC TOOAOAACAA OGAOOCAAAO 
OTCTCOAAGC GGOACAAGCC GGTCTGGGTO CTGAACCCTO AGOCGOGGAX 
GTGGCAOTGT CTOCTOAOTO ACTCGCGACA OOTCCTOCTO OAATCCAACA 
TCAAOCTTCT GCCCACATOO TCCACCCCOO TOCACOCOOA TCCOCAOCTC 
TOCTATATCC TGGATGCCAT CCTGTTTTTO TATGGTATTO TCCTTACCCT 
OCTCTACTOT COACTCAAGA TCCAGGTCCO AAAOOCAOAC- ATAOCCAGCC 
GTOAOAAATC AGATGCTGTC TACACGGOCC TOAACACCCO OAACCAGOAO 
ACATATGAGA CTCTGAAACA TGAGAAACCA CCCCAATAO 

(2) INFORMATION POR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1389 





ATGAACCGGG 


GAGTCCCTTT 


TAGGCACTTG 


CTTCTGGTGC 


TGCAACTGGC 


50 


i s 

08 


GCTCCTCCCA 


GCAGCCACTC 


AGGGAAACAA 


AGTGGTGCTG 


GGCAAAAAAG 


100 


fU 


GGGATACAGT 


GGAACTGACC 


TGTACAGCTT 


CCCAGAAGAA 


GAGCATACAA 


150 




TTCCACTGGA 


AAAACTCCAA 


CCAGATAAAG 


ATTCTGGGAA 


ATCAGGGCTC 


200 


i'""" 


CTTCTTAACT 


AAAGGTCCAT CCAAGCTGAA 


TGATCGCGCT 


GACTCAAGAA 


250 




GAAGCCTTTG 


GGACCAAGGA 


AACTTCCCCC 


TGATCATCAA 


GAATCTTAAG 


300 




ATAGAAGACT 


CAGATACTTA 


CATCTGTGAA 


GTGGAGGACC 


AGAAGGAGGA 


350 




GGTGCAATTG 


CTAGTGTTCO 


GATTGACTGC 


CAACTCTGAC 


ACCCACCTGC 


400 




TTCAGGGGCA 


GAOCCTGACC 


CTGACCTTGG 


AGAGCCCCCC 


TGGTAGTAGC 


450 




CCCTCAGTGC 


AATGTAGGAG 


TCCAAGGGGT 


AAAAACATAC 


AGGGGGGGAA 


500 




GACCCTCTCC 


GTGTCTCAGC 


TGGAGCTCCA 


GGATAGTGGC 


ACCTOGACAT 


550 




GCACTGTCTT 


GCAGAACCAG 


AAGAAGGTGG 


AGTTCAAAAT 


AGACATCGTG 


600 




GTOCTAGCTT 


TCCAGAAGGC 


CTCCAGCATA 


GTCTATAAGA 


AAGAGGGGGA 


650 




ACAGGTGGAG 


TTCTC CTTC C 


CACTCGCCTT 


TACAGTTGAA AAGCTGACGG 


700 




GCAGTGGCGA 


GCTGTGGTGG 


CAGGCGGAOA 


GGGCTTCCTC 


CTCCAAGTCT 


750 



y3 



Ft 3 
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TGOATCACCT 


TTGACCTOAA 


GAACAAGGAA 


OTGTCTOTAA 


AACGGOTTAC 


son 


CCAGOACCCT 


AAGCTCCAGA 


TGGGCAAGAA 


GCTCCCGCTC 


CACCTCACCC 


850 


TGCCCCAGOC 


CTTGCCTCAG 


TATGCTGOCT 


CTOOAAACCT 


CACCCTOOCC 


900 


CTTOAAOCGA 


AAACAGGAAA 


GTTGCArCAG 


GAAGTGAACC 


TGOTGGTGAT 


950 


GAOAOCCACT 


CAGCTCCAGA 


AAAATTTGAC 


CTGTGAGGTG 


TGOGGACCCA 


1000 


CCTCCCCTAA 


GCTGATGCTG 


AGCTTGAAAC 


TGGAGAACAA 


GGAGOCAAAO 


1050 


GTCTCGAAGC 


GGGAGAAGCC 


GGTGTGGGTG 


CTGAACCCTG 


. AGOCGGGGAT 


1100 


GTGGCAGTGT 


CTGCTGAGTG 


ACTCGGGACA 


GGTCCTOCTG 


GAATCCAACA 


1150 

A <A J V 


TCAAGGTTCT 


GCCCACATGG 


TCCACCCCGO 


TGCACGCGGA 


TCCCAAACTC 


*. A W V 


TOCTACCTOC 


TGGATGGAAT 


CCTCTTCATC 


TATOGTOTCA 


TTCTCACTOC 


1250 


CTTGTTCCTG 


AGAGTGAAGT 


.TCAGCAGCAO 


CGCAGAGCCC 


CCCGCGTACC 


1300 


AGCAGGGCCA 


GAACCAGCTC 


TATAACGAGC 


TCAATCTAGG 


ACGAAGAGAG 


1350 


GAGTACGATG 


TTTTGGACAA 


GAGACGTGGC 


COGGACCCTO 


AGATGGGGGO 


1400 


AAAGCCGAGA 


AGGAAGAACC 


CTCAGGAAGG 


CCTGTACAAT 


GAACTGCAGA 




AAGATAAGAI 


GGCGGAGGCC 


TACAGTGAGA 


TTGCOATOAA 


AGOCGAGCGC 


1500 


CGOAGGGGCA 


AGGGGCACGA 


TGGCCTTTAC 


CAGOGTCTCA 


GTACAGGCAC 


1550 


CAAGGACACC 


TACGACGCCC 


TTCACATGCA 


GOCCCTGCCC 


CCTCGCTAA 


1599 



gj (2) INFORMATION POR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 
4 s (A) LENGTH: 575 amino acids 

O (B) TYPE: amino acid 

3 y= (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOx4: 

Met Asn Arg Oly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Asn Lys Val Val Leu Gly Lys 

Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin LyS Lys Sar 

He Gin Phe Hi. Trp Lys Asn Ser Asn Gin He Lys iJe Leu Oly Asn 

55 60 
Gin Oly ser Phe Leu Thr Lys Oly Pro Ser Lys Leu Asn Asp Arg Ala 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Oly Asn Phe Pro Leu He ill 

90 95 
Lys Asn Leu Ly. He Olu Asp Ser Asp Thr Tyr II. Cys Olu Val Glu 



Asp Oln Lys Olu Glu Val Oln Leu £eu Val Phe Oly Leu Tnr Ala Asa 



115 120 



125 



Ser 


Asp 


Thr 


His 




130 






Ser 


Pro 


Pro 


Gly 


145 






Lys 


Asn 


He 


Gin 


Gin 


Asp 


Ser 


Gly 








1B0 


Val 


Glu 


Phe 


Lys 






195 


Ser 


He 


Val 


Tyr 




210 




Leu 


Ala 


Phe 


Thr 


225 








Gin 


Ala 


Glu 


Ara 


Lys 


Asn 


Lys 


Glu 








260 


Gin 


Met 


Gly 


Lys 






275 




Pro 


Gin 


Tyr 


Ala 




290 




Thr Oly 


Lys 


Leu 


305 






Gin 


Leu 


Gin 


Lys 


Lys 


Leu 


Met 


Leu 








340 


Lys Arg 


Glu 


Lys 






355 


Gin 


Cys 


Leu 


Leu 




370 






Lys Val 


Leu 


Pro 


385 








Cys 


Tyr 


Leu 


Leu 


Ala 


Leu 


Tyr 


Leu 








420 


Asn 


Leu 


Gin 


Asp 






435 




Arg Glu 


Glu 


Tyr 




450 




Met 


Gly 


Gly 


Lys 


465 






Asn 


Ala 


Leu 


Gin 


Thr 


Lys 


Gly 


Glu 








500 


Asp Ser 


His 


Phe 






515 




Gly 


Ser 


Glu 


Leu 




530 






Glu 


Ser 


Thr 


Gin 


555 








Pro 


Thr 


Leu 


Trp 



135 140 
Ser Ser Pro Ser Vtl Ola Cys Arg Ser Pro Arg Gly 

155 160 
Oly Oly Lys Thr Leu S.r Val Ser Oln L. tt Glu Leu 

170 175 
Thr Trp Thr Cy. Thr Val L.u Ola A.n oln Ly. Ly. 

190 

II* A.p II. Val V.l Leu Ala Ph. oln Ly. Ala S.r 

200 205 
Ly. Ly. Glu Gly Glu Gin Val Glu Ph. s.r Ph. Pro 

215 220 
Val Glu Ly. L.u Thr Oly s.r Oly Olu L.u Trp Trp 
JU 235 240 

Ala s.r Ser Ser Ly. Ser Trp 11. Thr Ph. A.p Leu 

Val S.r Val Ly. Arg V.l Thr Oln A.p Pro £" Leu 

265 270 
Lya Leu Pro Le « Hi « L.u Thr Leu Pro Oln Ala Leu 

285 

Gly ser Oly.A.a Leu Thr Leu Ala Leu Olu Ala Ly. 

295 300 
Hi. Oln Olu Val Asn Leu Val Val Met Arg Ala Thr 
10 31 5 320 

A.n Leu Thr Cy. Glu V.l Trp Oly Pro Thr S.r Pro 

330 335 
Ser Leu Lys Leu Glu Asn Ly. Glu Ala Ly. Val Ser 

345 350 
Pro Val Trp V.l L.u A.n Pro Olu Ala Oly Met Trp 

360 365 
S.r Asp Ser Oly Gin Val Leu Leu Olu Ser A.n U. 

- 375 3 80 
Thr Trp S.r Thr Pro V.l Hi. Al. A.p Pro Ly. L.u 

395 Ann 
A«p Oly He Leu Phe He Tyr Oly v.l II. n. Thr 
* 03 410 4 l5 

Arg Ala Ly. Phe S.r Arg Ser Ala Olu Thr Ala Ala 

425 430 
Pro Asn Gin Leu Tyr Asn Glu Leu A.n Leu Gly Arg 

440 445 
Asp Val Leu Glu Ly. Ly. Arg Ala Arg A.p Pro Glu 

455 460 
Gin Gin Arg Arg Arg A.n Pro Gin Olu Gly v.l Tyr 

475 Aon 
Ly. Asp Ly. Met Pro Glu Ala Tyr Ser Olu He Gly 
485 490 495 

Arg Arg Arg Gly Ly. Gly Hi. A.p Oly L.u Tyr Gin 

Gin Ala Val Oln Ph. Oly A.n Arg Arg Olu Arg Ola 

° 525 
=hr Arg Thr Leu Gly Leu Arg Ala Arg Pro Ly. Gly 

535 540 
Gin Ser Ser Gin Ser Cys Ala Ser Val Phe Ser lie 

Ser Pro Trp Pro Pro Ser Ser Ser Ser Gin Leu 
565 570 575 



(2) INFORMATION fOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY : linear 





(ii) MOLECULE 


TYPE: px 


rotei 




(xi) SEQUENCE 


DESCRIPTION i 


Met 

1 


Asn 


Arg 


Gly 


Val 


Pro 


Phe 


Arg 


Ala 


Leu 


Leu 


Pro 


5 

Ala 


Ala 


Thr 


Gin 








20 










Lys 


Gly Asp 


Thr 


Val 


Glu 


Leu 


Thr 






35 










40 


He 


Gin 


Phe 


His 


Trp 


Lys 


Asn 


Ser 




50 








55 




Oln 


Gly 


Ser 


Phe 


Leu 


Thr 


Lys 


Gly 


65 










70 


Asp 

r 


Ser Arg 


Arg 


Ser 


Leu 


Trp Asp 










85 










Asn 


Leu 


Lys 


He 


Glu 


Asp 


Ser 








100 








Asp 


Gin 


Lys 


Glu 


Glu 


Val 


Gin 


Leu 






115 










120 


Ser 


Asp 


Thr 


His 


Leu 


Leu 


Gin 


Gly 




130 










135 


Ser 


fro 


Pro 


Gly 


Ser 


Ser 


Pro 


Ser 


145 








150 






Lys 


a. ««% 
Asn 


Tl- 
116 


Gin 


Gly 


Gly 


Lys 


Thr 










165 




Oln 


Asp 


Ser 


Gly 


Thr 


Trp 


Thr 


Cys 








180 






Val 


OIU 


Pne 


Lys 


He 


Asp 


He 


Val 






195 








200 


Ser 


HQ 


vai 


Tyr 


Lys 


Lys 


Glu 


Gly 




*1U 








215 


Leu 


Ala 


Phe 


Thr 


Val 


Glu 


Lys 


T a** 

jueu 


225 










230 




Gin 


Ala 


Glu 


Arg 


Ala 


Ser 


Ser 


Ser 










245 








Lvs 


Asn 


Lys 


Glu 


Val 


Ser 


Val 


Lys 








260 








Gin 


Met 


Gly 


Lys 


Lys 


Leu 


Pro 


Leu 






275 










280 


Pro 


Gin 


Tyr 


Ala 


Gly 


Ser 


Gly 


Asn 




290 










295 




Thr 


Gly 


Lys 


Leu 


Eis 


Gin 


Glu 


Val 


305 










310 






Gin 


Leu 


Gin 


Lys 


Asn 


Leu 


Thr 


Cys 










325 






Lys 


Leu 


Met 


Leu 


Ser 


Leu 


Lys 


Leu 








340 








Lys 


Arg 


Glu 


Lys 


Pro 


Val 


Trp 


Val 






355 








360 


Gin 


Cys 


Leu 


Leu 


Ser 


Asp 


Ser 


Gly 




370 








375 


Lys 


Val 


Leu 


Pro 


Thr 


Trp 


Ser 


Thr 


385 










390 






Cys 


Tyr 


He 


Leu 


Asp 


Ala 


He 


Leu 










;os 






Leu 


Leu 


Tyr 


Cys 


Arg 


Leu 


Lys 


He 








420 








Ser 


Arg 


Glu 


Lys 


Ser 


Asp 


Ala 


Val 



SZQ ID NO: 5 j 



His 


Leu 


Leu 


Leu 


Val 


Leu 


Gin Leu 




10 










15 


Gly Asn Lys 


Val 


Val 


Leu 


Gly Lys 


25 










30 


Cys 


Thr 


Ala 


Ser 


Gin 


Lys 


Lys Ser 














Asn 


Gin 


He 


Lys 

60 


lie 


Leu 


Gly Asn 


Pro 


Ser 


Lys 


Leu 


Asn 


Asp 


Arg Ala 






75 








80 


Gin Gly Asn 


Phe 


Pro 


Leu 


He He 




90 










95 


Asp Thr 


Tyr 


He 


Cys 


Glu 


Val Glu 


105 








110 




Leu 


Val 


Phe 


Gly 


Leu 


Thr 


Ala Asn 










125 






Gin 


Ser 


Leu 


Thr 


Leu 


Thr 


Leu Glu 








140 








Val 


Gin 


Cys 


Arg 


Ser 


Pro 


Arg Gly 






155 








160 


Leu 


Ser 


Val 


Ser 


Gin 


Leu 


Glu Leu 


Thr 


170 










175 


Val 


Leu 


Gin 


Asn 


Gin 


Lys Lys 


185 










190 


Val 


Leu 


Ala 


Phe 


Gin 


Lys 


Ala Ser 


Glu 








205 




Gin 


Val 


Glu 


Phe 


Ser 


Phe Pro 








220 








Thr 


Gly 


Ser 


Gly 


Glu 


Leu 


Trp Trp 


Lys 




235 








240 


Ser 


Trp 


He 


Thr 


Phe 


Asp Leu 




250 










255 


Arg Val 


Thr 


Gin 


Asp 


Pro 


Lys Leu 


265 








270 


His 


Leu 


Thr 


Leu 


Pro 


Gin 


Ala Leu 










285 






Leu 


Thr 


Leu 


Ala 


Leu 


Glu 


Ala Lys 








300 






Asn 


Leu 


Val 


Val 


Met 


Arg 


Ala Thr 


Glu 




315 






320 

WAV 


Val 


Trp 


Gly 


Pro 


Thr 


Ser Pro 


Glu 


330 








335 


Asn 


Lys 


Glu 


Ala 


Lys 


Val Ser 


345 










350 




Leu 


Asn 


Pro 


Glu 


Ala 


Gly 


Met Trp 


Gin 








365 




Val 


Leu 


Leu 


Glu 


Ser 


Asn He 








380 








Pro 


Val 


His 


Ala 


Asp 


Pro 


Gin Leu 


Phe 




395 






400 


Leu 


Tyr 


Gly 


He 


Val 


Leu Thr 


Gin 


410 








415 


Val 


Arg 


Lys 


Ala 


Asp 


He Ala 


425 










430 




Tyr Thr Gly 


Leu 


Asn 


Thr 


Arg Asn 
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435 440 445 

Ola Olu Thr Tyr Olu Thr Leu Lys His Olu Lys Pro Pro Oln 
450 455 460 462 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 6: 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu 

15 10 
Ala Leu Leu Pro Ala Ala Thr Gin Gly Asn Lys Val 

20 25 
Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser 

35 - 40 

He Gin Phe His Trp Lys Asn Ser Asn Gin He Lys 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu 

65 70 75 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe 

85 90 
Lys Asn Leu Lys He Glu Asp Ser Asp Thr Tyr He 

100 105 
Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly 

115 ^ 120 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr 
130 135 140 

Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Aro 
145 150 155 

Lys Asn He Gin Gly Gly Lys Thr Leu Ser Val Ser 

165 170 
Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin 

180 185 
Val Glu Phe Lys He Asp He Val Val Leu Ala Phe 

195 200 
Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu 
210 215 220 

Leu Ala Phe Thr Val Olu Lys Leu Thr Gly Ser Gly 
225 230 235 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He 

245 250 
Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin 

260 265 
Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu 

275 280 
Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala 
290 295 300 

Thr Gly Lys Leu His Gin Glu Val Asn Leu Val Val 
305 310 315 

Oln Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly 

325 330 
Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu 

340 345 
Lys Arg Glu Lys Pro Val Trp Val Leu Asn Pro Glu 
355 360 
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30 




Gin 


Lys 


Lys Ser 


45 




He 


Leu 


Gly Asn 


Asn 


Asp 


Arg Ala 
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375 380 

Ser Thr Pro Val His Ala Asp Pro Lys Leu 
395 400 
He Leu Phe He Tyr Oly Val He Leu Thr 

410 415 
Lys Pbe Ser Arg Ser Ala Olu Pro Pro Ala 

«5 430 
Oln Leu Tyr Asn Olu Leu Asn Leu Oly Arg 

440 445 
Leu Asp Ljs Arg Arg Oly Arg Asp Pro Olu 
455 460 

Arg Lys Am Pro Gin Olu Oly Leu Tyr Asn 
475 480 
Met Ala Glu Ala Tyr Ser Olu He Oly Met 

490 495 
Oly Lys Oly His Asp Oly Leu Tyr Oln Oly 

505 510 
Asp Thr Tyr Asp Ala Leu His Met Oln Ala 
520 525 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



COCGGGGTGA CCGTGCCCTC CAQCAGCTT G GGC 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8t 



CGCGGGGATC CGTCGTCCAG AGCCCGTCCA GCTCCCCGTC CTGGGCCTCA 



(2) INFORMATION POR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CGCOGCCGGC CGCCACGCCO GCCAAGACAC CAC 



(2) INFORMATION TOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CGCGTTGACG AGCAGCCAGT TGGGCAGCAO CAG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LETCTH: IS bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CGCGGGCGGC CGCTA 



(2) INFORMATION POR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LETC TH: 42 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CGCGGGCTCO TTATXGAGCT GGTTCTGGCG CTGCTTCTTC 



(2) INFORMATION POR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 48 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CGCGGGOAOC TCOTTATAOA OCTOOTTTGC COCCGAATTC TTATCCCO 



(2) INFORMATION POR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
„ (A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CGCGGGGCGG CCACGCGTCC TCGCCAGCAC ACA 



(2) INFORMATION POR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 15: 



CGCGGGACGC GTTTCAGCCG TCCTCGCCAG CACACA 



(2) INFORMATION POR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



CGCGGGACGC GTGACCCTGA GATGGGGGGA AAG 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 



CGCGGGACOC GTATTGGGAT GAAAGGCGAG CGC 



(2) INFORMAT I ON POR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CCCOOATCCC AGCATGGGCA OCTCTT 



(2) INFORMATION POR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGCGGGGCGG CCGCTTTAGT TATTACTGTT GACATGGTCO TT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



GCGGGGGGAT CCCACTGTCC AAGCTCCCAG 



(2) INFORMATION FOR SEQ ID NO: 21: 



- 53 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singla 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21i 



GCGGGOOCGO CCGCCTAAAT ACGGTTCTGG TC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
" (A) LEHGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(zi) SEQUENCS DESCRIPTION: SEQ ID NO: 22: 



y~| TCAGAAAGAG ACAACCTGAA GAAACCAACA A 

UJ (2) INFORMATION POR SEQ ID NO: 23: 

SJ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



iU 

r- 



(ii) MOLECULE TYPE: nucleic acid 



\ . 5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TTOTTOOTTT CTTCAOOTTO TOTCTTTCTO A 31 

(2) INFORMATION POR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID .NO: 24: 

Met Glu His Ser Thr Phe Leu Ser Gly Leu Val Leu Ala Thr Leu Leu 

5 10 is 

Ser Gin Val Ser Pro Phe Lys lie Pro He Glu Glu Leu Glu Asp Arg 

20 25 30 

Val Phe Val Asn Cys Asn Thr Ser He Thr Trp Val Glu Gly Thr Val 

35 40 . 45 

Gly Thr Leu Leu Ser Asp He Thr Arg Leu Asp Leu Gly Lys Arg He 

50 55 60 

Leu Asp Pro Arg Gly He Tyr Arg Cyi Asn Gly Thr Asp He Tyr Lys 
65 70 75 80 

Asp Lys Glu Ser Thr Val Gin Val His Tyr Arg Met Cys Gin Ser Cys 

85 90 95 

Val Glu Leu Asp Pro Ala Thr Val Ala Gly He He Val Thr Asp Val 

100 ^ 105 no 

Ala He Thr Leu Leu Leu Ala Leu Gly Val Phe Cys Phe Ala Gly His 

115 120 125 

Glu Thr Gly Arg Leu Ser Gly Ala Ala Asp Thr Gin Ala Leu Leu Arg 

130 135 140 

Asn Asp Gin Val Tyr Gin Pro Leu Arg Asp Arg Asp Asp Ala Gin Tyr 
145 150 155 160 

Ser His Leu Gly Gly Asn Trp Ala Arg Asn Lys 
165 170 

(2) INFORMATION POR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Glu Gin Gly Lys Gly Leu Ala Val Leu He Leu Ala He He Leu 

5 10 is 

Leu Gin Gly Thr Leu Ala Gin Ser He Lys Gly Asn His Leu Val Ly« 

20 25 30 

Val Tyr Asp Tyr Gin Glu Asp Gly Ser Val Leu Leu Thr Cys Asp Ala 

35 40 45 

Glu Ala Lys Asn He Thr Trp Phe Lys Asp Gly Lys Met He Glv Phe 
50 55 60 
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Lys Lys Lys Trp Asn Leu Oly Ser Asn Ala 1 

70 75 
Tyr Oln Cys Lys Cly Ser Oln Asn Lys Ser L y . 

Tyr Arg Met Cys Cln Asn Cys lie Olu Leu Asn Ala 

105 110 
Oly Phe Leu Phe Ala Olu He Val Ser He Phe Val 

120 125 
Val Tyr Phe He Ala Oly Oln Asp Oly Val Arg Oln 

135 140 
Asp Lys Oln Thr Leu Leu Pro Asn Asp Oln Leu Tvr 

. 150 155 160 

Asp Arg Olu Asp Asp Gin Tyr Ser His Leu Oln Oly 
165 170 175 

Arg Asn 
180 



(2) INFORMATION POR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 
^ (D) TOPOLOOY: linear 

iO (ii) MOLECULE TYPE: amino acids 

UJ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 J 
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(2) INPX>RMATION POR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 aaino acids 

(B) TYPE t amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE! aaino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
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